Analysis of Methods and Strategies for Diversity Based Genetation of Classifier Ensemble

نویسندگان

  • M. S. Joshi
  • V. Y. Kulkarni
چکیده

A classifier ensemble is a group of individual base classifiers. Each classifier is trained individually by modifying the given data set to achieve diversity. During the testing phase the results given by each classifier are collected to give the final result using a technique called as majority voting. Empirical results prove that diversity amongst the base classifiers improves the accuracy of the result. The diversity can be achieved either based on datasets used for training or based on attribute selection methods. In this paper we summarize the study of the ensemble generation methods. We also propose to test a different approach that will find the distance based diversity of randomly generated datasets before the ensemble generation which will not only optimize the construction process but will improve the efficiency also. The Random Forest (RF) is a widely used algorithm for generating ensemble of decision tree classifiers. During the training phase it uses bagging technique to create bootstrapped samples. The best way to test the model is using 10-fold cross validation. As the empirical results show that the diversity in the classifiers improve the overall accuracy of the ensemble, it is needed to verify how diverse the training samples are. This will verify the existence of the base classifier in the ensemble based on the nature of randomly sampled dataset. The base classifier we yield with this approach certainly maintains a higher diversity and the accuracy of the ensemble improves further with these diverse base classifiers. We aim to achieve higher diversity with less number of base classifiers retained in the model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifier Ensemble Framework: a Diversity Based Approach

Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition,...

متن کامل

Fault Detection of Bearings Using a Rule-based Classifier Ensemble and Genetic Algorithm

This paper proposes a reduct construction method based on discernibility matrix simplification. The method works with genetic algorithm. To identify potential problems and prevent complete failure of bearings, a new method based on rule-based classifier ensemble is presented. Genetic algorithm is used for feature reduction. The generated rules of the reducts are used to build the candidate base...

متن کامل

Optimum Ensemble Classification for Fully Polarimetric SAR Data Using Global-Local Classification Approach

In this paper, a proposed ensemble classification for fully polarimetric synthetic aperture radar (PolSAR) data using a global-local classification approach is presented. In the first step, to perform the global classification, the training feature space is divided into a specified number of clusters. In the next step to carry out the local classification over each of these clusters, which cont...

متن کامل

Improving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering

Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...

متن کامل

A Preprocessing Technique to Investigate the Stability of Multi-Objective Heuristic Ensemble Classifiers

Background and Objectives: According to the random nature of heuristic algorithms, stability analysis of heuristic ensemble classifiers has particular importance. Methods: The novelty of this paper is using a statistical method consists of Plackett-Burman design, and Taguchi for the first time to specify not only important parameters, but also optimal levels for them. Minitab and Design Expert ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014